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DETAILED ACTION 

Claim Rejections - 35 USC § 102 

1 . The following is a quotation of the appropriate paragraphs of 35 U.S.C. 1 02 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b). by 
another filed in the United States before the invention by the applicant for patent or (2) a patent 
granted on an application for patent by another filed in the United States before the invention by the 
applicant for patent, except that an international application filed under the treaty defined in section 
351 (a) shall have the effects for purposes of this subsection of an application filed in the United States 
only if the international application designated the United States and was published under Article 21(2) 
of such treaty in the English language. 

2. Claims 1-63 are rejected under 35 U.S.C. 102(e) as being anticipated by 
Minamino et a!., (7,013,277). 

As per claim 1, Minamino et al., teach a program product for speech recognition, 
comprising machine-readable program code for causing, when executed, a machine to 
perform the following method: detecting at least one speech element in an utterance of 
acoustic data which meets a criteria for potentially being reduced; presenting at least a 
portion of the utterance of acoustic data to a user with a prompt to determine if the 
speech element is reduced; and if verification data is received from the user that the 
speech element in the utterance is reduced, training an acoustic model for a reduced 
form of the speech element using data related to the utterance (Col.21, line 39 - Col.22, 
line 47, Col. 19, lines 1-60). 
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As per claim 2, Minamino et al., teach the program product as defined in claim 1, 
wherein the criteria is that a portion of the acoustic data of the utterance matches data 
in a dictionary of reduced speech elements (Col. 16, lines 14 - 65). 

As per claim 3, Minamino et al., teach the program product as defined in claim 2, 
wherein the matched data in the dictionary comprises acoustic data for a reduced 
speech element (Col. 16, lines 14 - 65). 

As per claim 4, Minamino et al., teach the program product as defined in claim 2, 
wherein the matched data in the dictionary comprises word sequence context data 
(Col. 16, lines 14-65). 

As per claim 5, Minamino et al., teach the program product as defined in claim 2, 
wherein the dictionary is external to a speech recognition model in a base speech 
recognition process (Col. 16, lines 14 - 65). 

As per claim 6, Minamino et al., teach the program product as defined in claim 1, 
wherein said acoustic model for said reduced form is a discriminative model 
distinguishing said reduced form from the corresponding unreduced form (Col. 16, lines 
14-65). 

As per claim 7, Minamino et al., teach the program product as defined in claim 1, 
wherein the criterion is that the speech element has a substantially lower amplitude than 
in an unreduced model for that speech element (Col.16, lines 14 - 65). 
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As per claim 8, IVIinamino et al., teacli the program product as defined in claim 1, 
wherein the criterion is that the speech element has a substantially shorter duration than 
an average duration for the speech element (Col. 16, lines 14 - 65). 

As per claim 9, Minamino et al., teach the program product as defined in claim 1, 
wherein the criterion is that the speech element has acoustic characteristics that are . 
less extreme than an unreduced model of the speech element (Col.16, lines 14 - 65). 

As per claim 10, Minamino et al., teach the program product as defined in claim 
1 , wherein the criterion is that the speech element has acoustic characteristics that are 
more like an average speech sound than is an unreduced model for the speech element 
(Col.16. lines 14-65). 

As per claim 1 1 , Minamino et al., teach the program product as defined in claim 
1 , wherein the speech element is a vocalic and the criterion is that the speech element 
has acoustic characteristics more similar to a uniform acoustic tube than does an 
unreduced model for the speech element (Col.16, lines 14 - 65). 

As per claim 12, Minamino et al., teach the program product as defined in claim 
l', wherein the criterion is that the speech element has acoustic characteristics 
associated with an incomplete articulatory gesture (Col.16, lines 14 - 65). 

As per claim 13, Minamino et al., teach the program product as defined in claim 
1, wherein the unreduced speech element is formed by closure between the tongue and 
the roof of the mouth and the criterion is that the speech element has acoustic 
characteristics associated with only an incomplete or brief contact of the tongue with the 
roof of the mouth (Col. 16, lines 1 4 - 65). 
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As per claim 14, i\/linamino et al., teach a program product for speech 
recognition, comprising machine-readable program code for causing, when executed, a 
machine to perform the following method: receiving a training utterance of acoustic data 
of a word sequence; detecting if the training utterance of acoustic data has at least one 
speech element which meets a criterion for potentially being reduced; presenting the 
utterance of acoustic data to a user with a prompt to determine if the speech element is 
reduced; and if verification data is received from the user that the speech element in the 
utterance is reduced, then associating a reduced designation with the speech element 
in the training utterance designated as reduced (Col.21, line 39 - Col.22, line 47, 
Col. 19. lines 1-60). 

As per claim 15, Minamino et al., teach the program product as defined in claim 
14, wherein the criterion is that the speech element has a substantially lower amplitude 
than in an unreduced model for that speech element (Col. 16, lines 14 - 65). 

As per claim 16, Minamino et al., teach the program product as defined in claim 
14, wherein the criterion is that the speech element has a substantially shorter duration 
than an average duration for speech element (Col. 16, lines 14 - 65). 

As per claim 17, Minamino et al., teach the program product as defined in claim 
14, wherein the criterion is that the speech element has acoustic characteristics that are 
less extreme than an unreduced model for the speech element (Col. 16, lines 14 - 65). 

As per claim 18, Minamino et al., teach the program product as defined in claim 
14, wherein the criterion is that the speech element has acoustic characteristics that are 
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more like an average speech sound than is an unreduced model for the speech element 
(Col. 16, lines 14 -65). 

As per claim 19, Minamino et al., teach the program product as defined in claim 
14, wherein the speech element is a vocalic and the criterion is that the speech element 
has acoustic characteristics more similar to a uniform acoustic tube than does an 
unreduced model for the speech element (Col. 16, lines 14 - 65). 

As per claim 20, Minamino et al., teach the program product as defined in claim 
14, wherein the criterion is that the speech element has acoustic characteristics 
associated with an incomplete articulatory gesture (Col. 16, lines 14 - 65). 

As per claim 21, Minamino et al., teach the program product as defined in claim 
14, wherein the unreduced speech element is formed by closure between the tongue 
and the roof of the mouth and the criterion is that the speech element has acoustic 
characteristics associated with only an incomplete or brief contact of the tongue with the 
roof of the mouth (Col.16, lines 14 - 65). 

As per claim 22, Minamino et al., teach a speech recognition process, 
comprising: detecting at least one speech element in an utterance of acoustic data 
which meets a criteria for potentially being reduced; presenting at least a portion of the 
utterance of acoustic data to a user with a prompt to determine if the speech element is 
reduced; and if verification data is received from the user that the speech element in the 
utterance is reduced, training a discrimination model using data related to the utterance 
(Col.21, line 39-Col.22, line 47, Col. 19, lines 1-60). 
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As per claim 23, Minamino et al., teach the method as defined in claim 22, 
wherein the criteria is that a portion of the acoustic data of the utterance matches data 
in a dictionary of reduced speech elements (Col. 16, lines 14 - 65). 

As per claim 24, Minamino et al., teach the method as defined in claim 23, 
wherein the matched data in the dictionary comprises acoustic data for a reduced 
speech element (Col. 16, lines 14 ~ 65). 

As per claim 25, Minamino et al., teach the method as defined in claim 23, 
wherein the matched data in the dictionary comprises word sequence context data. 

As per claim 26, Minamino et al., teach the method as defined in claim 22, 
wherein the dictionary is external to a speech recognition model in a base speech 
recognition process (Col. 16, lines 14 - 65). 

As per claim 27, Minamino et al., teach the method as defined in claim 22, 
wherein said acoustic model for said reduced form is a discriminative model 
distinguishing said reduced form from the corresponding unreduced form (Col. 16, lines 
14-65). 

As per claim 28, Minamino et al., teach the method as defined in claim 22, 
wherein the criterion is that the speech element has a substantially lower amplitude than 
in an unreduced model for that speech element (Col. 16. line 14 - C0LI8, line 65). 

As per claim 29, Minamino et al., teach the method as defined in claim 22. 
wherein the criterion is that the speech element has a substantially shorter duration than 
an average duration for the speech element (Col. 16, line 14 - Col. 18, line 65). 
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As per claim 30, Minamino et al., teach the method as defined in claim 22, 
wherein the criterion is that the speech element has acoustic characteristics that are 
less extreme than an unreduced model of the speech element (Col. 16, line 14 - Col. 18, 
line 65). As per claim 31 , Minamino et al., teach the method as defined in claim 22, 
wherein the criterion is that the speech element has acoustic characteristics that are 
more like an average speech sound than is an unreduced model for the speech element 
(Col.16, line .14 - Col.18, line 65). 

As per claim 32, Minamino et al., teach the method as defined in claim 22, 
wherein the speech element is a vocalic and the criterion is that the speech element has 
acoustic characteristics more similar to a uniform acoustic tube than does an unreduced 
model for the speech element (Col.16, line 14 - Col.18, line 65). 

As per claim 33, Minamino et al., teach the method as defined in claim 22, 
wherein the criterion is that the speech element has acoustic characteristics associated 
with an incomplete articulatory gesture (Col.16, line 14 - Col.18, line 65). 

As per claim 34, Minamino et a!., teach the method as defined in claim 22, 
wherein the unreduced speech element is formed by closure between the tongue and 
the roof of the mouth and the criterion is that the speech element has acoustic 
characteristics associated with only an incomplete or brief contact of the tongue with the 
roof of the' mouth (Col.16, line 14 -Col.18, line 65). 

As per claim 35, Minamino et al., teach a speech recognition method, 
comprising: receiving a training utterance of acoustic data of a word sequence; 
detecting if the training utterance of acoustic data has at least one speech element 
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which meets a criterion for potentially being reduced; presenting the utterance of 
acoustic data to a user with a prompt to detemiine if the speech element is reduced; 
and if verification data is received from the user that the speech element in the 
utterance is reduced, then associating a reduced designation with the speech element 
in the training utterance designated as reduced (Col.21, line 39 - Col.22, line 47, 
Col. 19, lines 1-60). 

As per claim 36, Minamino et al., teach the method as defined in claim 35, 
wherein the criterion is that the speech element has a substantially lower amplitude than 
in an unreduced model for that speech element (Col. 16, line 14 - Col. 18, line 65). 

As per claim 37, Minamino et al., teach the method as defined in claim 35, 
wherein the criterion is that the speech element has a substantially shorter duration than 
an average duration for speech element (Col.16, line 14 - Col. 18, line 65). 

As per claim 38, Minamino et al., teach the method as defined in claim 35, 
wherein the criterion is that the speech element has acoustic characteristics that are 
less extreme than an unreduced model for the speech element (Col.16, line 14 - 
Col.18, line 65). 

As per claim 39, Minamino et al., teach the method as defined in claim 35, 
wherein the criterion is that the speech element has acoustic characteristics that are 
more like an average speech sound than is an unreduced model for the speech element 
(Col.16, line 14 - Col.18, line 65). 

As per claim 40, Minamino et al., teach the method as defined in claim 35, 
wherein the speech element is a vocalic and the criterion is that the speech element has 
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acoustic characteristics more similar to a uniform acoustic tube than does an unreduced 
model for the speech element (Col. 16, line 14 - Col. 18, line 65). 

As per claim 41, Minamino et al., teach the method as defined in claim 35, 
wherein the criterion is that the speech element has acoustic characteristics associated 
with an incomplete articulatory gesture (Col. 16, line 14 - Col. 18, line 65). 

As per claim 42, Minamino et al., teach the method as defined in claim 35, 
wherein the unreduced speech element is formed by closure between the tongue and 
the roof of the mouth and the criterion is that the speech element has acoustic 
characteristics associated with only an incomplete or brief contact of the tongue with the 
roof of the mouth (Col. 16, line 14 - Col. 18, line 65). 

As per claim 43, Minamino et al., teach a speech recognition system, comprising: 
a detector for receiving at least one speech element in an utterance of acoustic data 
which meets a criteria for potentially being reduced; a presentation device for presenting 
at least a portion of the utterance of acoustic data to a user with a prompt to determine if 
the speech element is reduced; and a computer training device that, if verification data 
is received from the user that the speech element in the utterance is reduced, trains an 
acoustic model for a reduced form of the speech element using data related to the 
utterance (Col.11. line 63 -Col. 13, line 53). 

As per claim 44, Minamino et a!., teach the system as defined in claim 43, 
wherein the criteria is that a portion of the acoustic data of the utterance matches data 
in a dictionary of reduced speech elements (Col. 16, line 14 - Col. 18, line 65). 
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As per claim 45, Minamino et a!., teach the system as defined in claim 44, 
Wherein the matched data in the dictionary comprises acoustic data for a reduced 
speech element (Col.16, line 14 - Col.18, line 65). 

As per claim 46, Minamino et al., teach the system as defined in claim 44, 
wherein the matched data in the dictionary comprises word sequence context data. 

As per claim 47, Minamino et al., teach the system as defined in claim 44, 
wherein the dictionary Is external to a speech recognition model in a base speech 
recognition process (Col.16, line 14 - Col.18, line 65). 

As per claim 48, Minamino et al., teach the system as defined in claim 43, 
wherein said acoustic model for said reduced form is a discriminative model 
distinguishing said reduced form from the corresponding unreduced form (Col.16, line 
14 -Col.18, line 65). 

As per claim 49, Minamino et al., teach the system as defined in claim 43, 
wherein the criterion Is that the speech element has a substantially lower amplitude than 
in an unreduced model for that speech element (Col.16, line 14 - Col.18, line 65). 

As per claim 50, Minamino et al., teach the system as defined in claim 43, 
wherein the criterion is that the speech element has a substantially shorter duration than 
an average duration for the speech element (Col.16, line 14 - Col.18, line 65). 

As per claim 51, Minamino et al., teach the system as defined in claim 43, 
wherein the criterion is that the speech element has acoustic characteristics that are 
less extreme than an unreduced model of the speech element (Col.16, line 14 - Col.18, 
line 65). 
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As per claim 52, Minamino et al., teach the system as defined in claim 43, 
wherein the criterion is that the speech element has acoustic characteristics that are 
more like an average speech sound than is an unreduced model for the speech element 
(Col.16, line 14 - Col.18, line 65). 

As per claim 53, Minamino et a!., teach the system as defined in claim 43, 
wherein the speech element is a vocalic and the criterion is that the speech element has 
acoustic characteristics more similar to a uniform acoustic tube than does. an unreduced 
model for the speech element (Col.16, line 14 - Col.18, line 65). 

As per claim 54, Minamino et al., teach the system as defined in claim 43, 
wherein the criterion is that the speech element has acoustic characteristics associated 
with an incomplete articulatory gesture (Col.16, line 14 - Col.18, line 65). 

As per claim 55, Minamino et al., teach the system as defined in claim 43, 
wherein the unreduced speech element is formed by closure between the tongue and 
the roof of the mouth and the criterion is that the speech element has acoustic 
characteristics associated with only an incomplete or brief contact of the tongue with the 
roof of the mouth (Col.16, line 14 -Col.18, line 65). 

As per claim 56, Minamino et al., teach a speech recognition system, comprising: 
a receiver for receiving a training utterance of acoustic data of a word sequence; a 
detector for detecting if the training utterance of acoustic data has at least one speech 
element which meets a criterion for potentially being reduced; a presentation device for 
presenting the utterance of acoustic data to a user with a prompt to determine if the 
speech element is reduced; and logic for, if verification data is received from the user 
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that the speech element in the utterance is reduced, then associating a reduced 
designation with the speech element in the training utterance designated as reduced 
(Col. 1 1 , line 63 - Col. 1 3, line 53). 

As per claim 57, Minamino et al., teach the system as defined in claim 56, 
wherein the criterion is that the speech element has a substantially lower amplitude than 
in an unreduced model for that speech element (Col. 16, line 14 - Col. 18, line 65). 

As per claim 58, Minamino et al., teach the system as defined in claim 56, 
wherein the criterion is that the speech element has a substantially shorter duration than 
an average duration for speech element. 

As per claim 59, Minamino et al., teach the system as defined in claim 56, 
wherein the criterion is that the speech element has acoustic characteristics that are 
less extreme than an unreduced model for the speech element (Col. 16, line 14 - 
Col. 18, line 65). 

As per claim 60, Minamino et al., teach the system as defined in claim 56, 
wherein the criterion is that the speech element has acoustic characteristics that are 
more like an average speech sound than is an unreduced model for the speech element 
(Col.16, line 14 - Col.18, line 65). 

As per claim 61, Minamino et al., teach the system as defined in claim 56, 
wherein the speech element is a vocalic and the criterion is that the speech element has 
acoustic characteristics more similar to a unifomi acoustic tube than does an unreduced 
model for the speech element (Col.16, line 14 - Col.18, line 65). 
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As per claim 62, IVIinamino et al., teacli the system as defined In claim 56, 
wherein the criterion is that the speech element has acoustic characteristics associated 
with an incomplete articulatory gesture (Col. 16, line 14 - Col. 18, line 65). 

As per claim 63, Minamino et al., teach the system as defined in claim 56, 
wherein the unreduced speech element is formed by closure between the tongue and 
the roof of the mouth and the criterion is that the speech element has acoustic 
characteristics associated with only an incomplete or brief contact of the tongue with the 
roof of the mouth (Col.16, line 14 - Col.18, line 65). 

Conclusion 



3. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. 

Dahan et al., (6,018,708) teach a method and apparatus for performing speech 

recognition utilizing a supplementary lexicon orthographies. 

DeSimone (5,539,861) teaches speech recognition using bio-signals. 

Yamada (6,832,192) teaches a speech synthesizing method and apparatus. 

Gould et al., (5,920,837) teach a word recognition system which stores two models for 

some words and allows selective deletion of one such model. 

Bennett et al., (6,633,846) teach a distributed realtime speech recognition system. 

Even et al., (7,085,716) teach speech recognition using word-in-phrase command. 
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Takagi (5,819,223) teaches speech adaptation device suitable for speech recognition 
device and word spotting device. 

Gould et al., (5,915,236) word recognition system which alters code executed as a 
function of available computational resources. 

Bah! et a!., (5,884,259) teach a method and apparatus for a time synchronous tree- 
based search strategy. 

Chou et al., (5,797,123) teach a method of key-phrase detection and verification for 
flexible speech understanding. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Vijay B. Chawan whose telephone number is (571) 272- 
7601 . The examiner can normally be reached on Monday Through Friday 6:30-3:00. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on (571) 272-7602. The fax phone 
number for the organization where this application or proceeding is assigned is (571) 
273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status infomiation for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 




vijay &/Chawan 
Primary Examiner 
Art Unit 2626 



vbc 
5/8/07 



PRiMARy EXAMINER 



